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Abstract 

Bit-flipping (BF) decoding of low-density parity-check codes is of low complexity but gives inferior 
performance in general. To improve performance and provide new BF decoder options for complexity- 
performance tradeoffs, we propose new designs for the flipping function (FF), the flipped bit selection 
(FBS) rule and the checksum weight updating schedule. The new FF adjusts the checksum weights 
in every iteration while our FBS rules take more information into account. These two modifications 
represent efforts to track more closely the evolutions of both check and variable nodes’ reliabilities. 
Two selective update schedules are proposed to offer more performance and complexity tradeoffs. 

The combinations of the new FBS rule and known FFs result in new BF decoders with improved 
performance and a modest complexity increase. On the other hand, combining the new FF and FBS rule 
gives a new decoder with performance comparable to that of the normalized min-sum algorithm while 
if we use a much simpler FBS rule instead, the decoder suffers little performance loss with reduced 
complexity. We also present a simple decision-theoretical argument to justify the new checksum weight 
formula and a time-expanded factor graph model to explain the proposed selective weight-updating 
schedules. 
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I. Introduction 

Low-density parity-check (LDPC) codes have been shown to asymptotically give near-capacity 
performance when the sum-product algorithm (SPA) is used for decoding [H]. Gallager proposed 
two alternatives that use only hard-decision bits [|3. These so-called bit-flipping (BF) algorithms 
flip one or a group of bits based on the values of the flipping functions (FFs) computed in each 
iteration. The FF associated with a variable node (VN) is a reliability metric of the corresponding 
bit decision and depends on the binary-valued checksums of the VN’s connected check nodes 
(CNs). Although BF algorithms are much simpler than the SPA, their performance is far from 
optimal. To reduce the performance gap, many variants of Gallager’s BF algorithms have been 
proposed. Most of them tried to improve the VN’s reliability metric (the FF) and/or the method 
of selecting the flipped bits, achieving different degrees of bit error rate (BER) and convergence 
rate performance enhancements at the cost of higher complexity. 

The class of weighted bit-flipping (WBF) algorithms [l3]|-|I71| assign proper weights to the binary 
checksums. Each weight can be regarded as a reliability metric on the corresponding checksum 
and is a function of the associated soft received channel values. Another approach called gradient 
descent bit-flipping (GDBF) algorithm was proposed by Wadayama et al. dH. Instead of using 
a weighted checksum based FF, the GDBF algorithm derives its FF by computing the gradient 
of a nonlinear objective function which is equivalent to the log-likelihood function of the bit 
decisions with checksum constraints. It was shown that the GDBF algorithm outperforms most 
known WBF algorithms when the VN degree is small. Sundararajan et al. [fTTlI modified this 
FF by introducing a weighting on syndrome and a zero-mean Gaussian perturbation term. The 
resulting noisy GDBF (NGDBF) algorithm improves the performance of the GDBF algorithm 
which is further enhanced by adding a re-decoding process lU^ . 

For the WBF algorithms, the weights are decided by the soft received channel values and 
remain unchanged throughout the decoding process. Since the weights reflect the decoder’s 
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belief on the ehecksums which, in turn, depend on those of the associated VNs’ FF and bit 
decisions, the associated checksum weights should be updated accordingly. In lfT9l . a reliability- 
based schedule is used in the initial decoding iteration to forward only reliable VN and CN’s 
messages. Nguyen and Vasic lf20l employed an extra bit to adaptively represent the reliabilities of 
VN and CN messages and developed a class of two-bit BF algorithms along with algorithm(s)- 
selection procedures. In this paper, we present dynamic weighted BF (DWBF) algorithms that 
assign dynamic checksum weights which are updated according to a nonlinear function of 
the associated VNs’ FF values. As we shall show, the nonlinear function has the effects of 
quarantining unreliable checksums (which is similar to the method used in the first decoding 
iteration of [HH) while dynamically adjusts the more reliable checksums’ weights. A simple 
decision theoretical interpretation is given to explain the effect of the nonlinear action and 
justify the threshold selection. We also suggest two selective weight-updating schedules which 
offer additional performance-complexity trade-offs. A time-expanded factor graph model is used 
to illustrate the weight-updating schedules. 

The single-bit BF algorithms flip only the least reliable bit thus result in slow convergence 
rates. For this reason, many a multiple-flipped-bit selection rule was suggested ||9||- |[T^ . By 
simultaneously flipping the selected bits, a BF decoder can offer rapid convergence but, some¬ 
times, at the expense of performance loss. A bit selection rule may consist of simple threshold 
comparisons or include a number of steps involving different metrics. It is usually designed 
assuming a specific FF is used and may not be suitable when a different FF or metric is involved. 
Moreover, the FF value may not provide sufficient information for making a tentative bit decision, 
we propose a new flipped bit selection (FBS) rule that takes into account both the FF value and 
other information from related CNs. 

FF, checksum weight computing, weight-updating schedules, and FBS rule are major con¬ 
stituent parts of a DWBF decoder. Our proposals on these parts offer a variety of new design 
options and tradeoffs. The efficiencies of using the proposed schemes jointly or separately with 
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existing designs are evaluated by examining the corresponding numerical error rate and conver¬ 
gence behaviors. We show that our single-bit DWBF algorithm provides significant performance 
improvement over the existing single-bit GDBF and WBF algorithms. Our FBS rule works 
very well with different FFs and outperforms known FBS rules. Moreover, the selective weight¬ 
updating schedules suffer little performance degradation while offer significant complexity re¬ 
duction when the CN and VN degrees are small. 

Note that since the checksum weights are crucial parameters of a DWBF decoder’s FF and 
their updates depend in turn on the FF values computed at the previous iteration, we henceforth 
mean both the FF and the associated weight computing formula whenever FF is mentioned. 

The rest of this paper is organized as follows. In Section |nl we define the basic system 
parameters and give a brief overview of various BF decoding algorithms, their FFs and FBS 
rules. In Section |nll we introduce a new FF and its checksum weight-updating formula. A simple 
decision theoretical justification is given. We consider single-bit BF decoders and present two 
weight-updating schedules as well as their graphic models in Section |IVl The performance of the 
our single-bit DWBF algorithm and some known single-bit BF algorithms are compared in the 
same section. We develop a new multi-bit FBS rule and present the error rate and convergence 
behaviors of various multi-bit BF decoder structures based on the new FBS rule in Section |Vl 
These decoders’ complexities are analyzed in details for evaluating various performance and 
complexity tradeoffs. Finally, conclusion remarks are drawn in Section IVll 

II. Background and Related Works 
A. Notations and the Basic Algorithm 

We denote by {N, K)(dv, dc) a regular binary LDPC code C with VN degree d^ and CN 
degree dc, i.e., C is the null space of an M x parity check matrix H = [Hmn] which has 
dv I’s in each column and dc I’s in each row. Let it be a codeword of C and assume that 
the BPSK modulation is used so that a codeword u = - ,M 7 v-i), Ui G {0,1}, is 
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mapped into a bipolar sequence x = (xq, Xi, ■ ■ ■ , xjv^i) = (l — 2uo, l — 2ui, ■ ■ ■ , l — 2ujv_i) for 
transmission. The equivalent baseband transmission channel is a binary-input Gaussian-output 
channel characterized by additive zero-mean white Gaussian noise with two-sided power spectral 
density of iVo/2 W/Hz. Let y = ,|/Ar-i) be the sequence of soft channel values 

obtained at the receiver’s coherent matched filter output. The sequence z = (zq, zi, - ■ ■ , zjv-i), 
where Zi E {0,1}, is obtained by taking hard-decision on each components of y. Let ii = 
{uq, Ml, • • • , mat-i) be the tentative decoded binary sequence at the end of a BF decoding iteration. 
We compute the syndrome (checksum) vector s = (sq, Si, • • • , sm-i) by s = ft ■ iT^(mod 2). 
We further denote the nth VN by Vn, the set of indices of its connecting CNs by M.{n), and the 
set of indices of the VNs checked by the mth CN Cm by AA(m). The indices of CNs in M.{n) 
are determined by the nonzero elements of the nth column of H whereas those in Af(m) are 
by the mth row of H. 


A generic BF decoding algorithm can be described by Algorithm [T] below which involves 
four important parameters: I, the iteration number; Zmax, the maximum iteration number; En, the 
FF; and B, the index set of the flipped bits, or the flipped bit (FB) set for short. This algorithm 
performs two basic tasks: 1) computing EnS (Step 2) and 2) generating the FB set B (Step 3). 
Most earlier works focused on improving either 1) or 2). An FF, sometimes referred to as cost 
function or inversion function 0, is used as a reliability metric on a VN’s tentative decision. 
Given the FF values and the FBS rule, we select a set of VNs and flip the corresponding tentative 
decisions (bits). Choosing the most unreliable bits or the bits whose FF values exceed a threshold 
are the two most popular rules. For the former rule, usually only one bit is flipped if the soft- 
valued channel information is employed in the FF, resulting in slow convergence. By contrast, 
the latter rule often gives faster convergence rate but possibly at the cost of performance loss. 
We briefly review the known FFs and FBS rules in the following paragraphs. 
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Algorithm 1 Bit-Flipping Decoding Algorithms 

Initialization Set I = 0 and u = z. 

Step 1 V m e Om — {0,1,..., M — 1}, compute 

Sm ^ ^ 2 ). ( 1 ) 

nGAf{m) 

If s = 0 or / = /max, Stop dccoding and output ii; otherwise, / <(— / -f 1. 

Step 2 V n G = {0,1,..., — 1}, compute the FF E^. 

Step 3 Use the FFs obtained in Step 2 to update the flipped bit set B. 

Step 4 Flip Un for all n G S and go to Step 1. 


B. Flipping Functions of BF Decoding Algorithms 

Gallager proposed that a simple sum of binary checksums be used as the FF |[2l 

En = - Y. (l-2s^)- (2) 

dH) implies that the FF value is inversely proportional to the bit decision reliability as it is an 
increasing function of the number of nonzero checksums (i.e., unsatisfied check nodes, UCNs). 

By taking into account soft-valued channel information and assigning checksum weights, later 
modifications of Gallager’s FF can be described by the following general formula 

Bn 0^1 • fifn^ijn) ^ ^ ^mnif (3) 

where Oi\ > 0, (p{un, Vn) is a reliability metric involving channel value and/or bit decision, and, 
to be consistent with Q, Wmn > 0. 

For the WBF algorithm flUl, (f){un,yn) = 0 and Wmn is 

Wmn= ,min ||/^/|, (4) 

n GAA(m) 

The modified WBF (MWBF) algorithm |I51 has f{un,yn) = \yn\ while the improved MWBF 
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(IMWBF) algorithm [|^ uses the same 0(Mn, Vn) but replaees the cheeksum weight by 


Wmn = , min 

n €Af{m)\n 


( 5 ) 


for the message passed from Cm to Vn should exelude that originated from For the reliability 
ratio based WBF (RRWBF) algorithm 0, (p{un,yn) = 0 and 


w. 


= ^/^mn = P 


\yri 


lLU.QiX.n'(^J\f(^m) \yn 


-1 


( 6 ) 


where f3 is the normalizing faetor to ensure that "^'mn = 

The GDBF algorithm of Wadayama et al. |l9l| applies the gradient descent method to minimize 

iV-l M-l 

E{u) = - ^ - 2un) - (1 - 2sm) ( 7 ) 

n=0 m=0 


with respect to (1 — 2 m„) and obtains the FF 


En = -ynO- - ‘^Un) - ^ {l-2Sm), ( 8 ) 

mdAdin) 

which is equivalent to assigning ai = 1, (t){un, yn) = 2/n(l — 2Mn), and Wmn = 1 in ([3]). Recently, 
Sundararajan et g/. lfTTll introduced the so-called noisy GDBF (NGDBF) algorithm based on 


En = -ynO- - - W ^ (1 - 2Sm) + (9) 

meAt(n) 

where g„’s in dH) are i.i.d. zero-mean Gaussian random perturbation with a signal-to-noise ratio 
(SNR) dependent variance and te is a constant syndrome weight. 

For the above two FFs, dU) and dH), (j){un,yn) = 2/n(l — 2nn) is equal to \yn\ when the bit 
decision Un is the same as the |/„-based hard decision, Zn', otherwise, its value is the same as 
— \yn\- This is consistent with the intuition that iin = Zn implies that Un is likely to be correct and 
since —En is a VN reliability metric, a positive yn) helps increasing —En and preventing 
the Un from being flipped. In contrast, (j){un,yn) is always positive (\yn\) in MWBF algorithms, 
which means that the MWBF algorithms tend to trust Un in spite of other evidence. 
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C. Flipped Bit Selection Rules 

For the algorithms mentioned in the Seetion III-Bl only the bit(s) related to the VN having the 
largest FF value En is (are) flipped in eaeh iteration, i.e., the FB set is 


B = {n\n = argmaxE'j}. 


( 10 ) 


As mentioned before, \B\ = 1 and the eorresponding eonvergenee is often very slow if En has 
a soft-valued information term. 

Flipping several bits in eaeh iteration simultaneously ean improve the eonvergenee speed. The 
simplest FBS rule for multi-bit BF deeoding uses the FB set 


B = {n\En > A}, 


( 11 ) 


where the threshold A ean be a eonstant or be adaptive. The optimal adaptive threshold was 
derived by Gallager |l2l, assuming that no eyele appears in the eode graph. Sinee praetieal finite- 
length LDPC eodes usually have eyeles and the optimal thresholds ean only be found through 
time-eonsuming simulations, two ad-hoe methods whieh automatieally adjust A were suggested 
in [fT^ and [QAII . In the adaptive threshold BF (ATBF) algorithm ifT^ . the initial A is found 
by simulation and subsequent thresholds are a monotonieally non-inereasing funetion of the 
deeoding iterations. The adaptive MWBF (AMWBF) algorithm lfT3]| adjusts the threshold by 



( 12 ) 


where E* = maXnEn and ich(s) is the Hamming weight of the syndrome veetor s. 

Sometimes, a tentative deeoded veetor ii may reappear several times during the deeoding 
proeess and form a deeoding loop. This may be eaused, for example, by the event that an even 
number of bits assoeiated with a CN are flipped, leading to an unehanged eheeksum and then 
oseillating bit deeisions. To eliminate the oeeurrenee of loops, the AMWBF algorithm ineludes 
the loop deteetion seheme of |[8l in its FBS rule so that if a loop is deteeted, the most reliable 
bit in B is removed. The parallel weighted BF (PWBF) algorithm IfTOll tries to reduee the loop 
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occurrence probability by having every UCN (s^ = 1) send a constant flip signal (FS) to its 
least reliable linked VN (based on the FF of the IMWBF algorithm) and flips the bits in 

B = > Afs}, (13) 

where Aps is a constant optimized by simulations, 

Fn ^ ^ Qmn^m') (14) 

meAt(n) 

and qmn is given by 

{ 1, n = arg max E^/ 

( 15 ) 

0, otherwise 

Since the above remedy ean only eliminate loops with a eertain probability, the improved PWBF 
(IPWBF) algorithm employs the loop deteetion seheme of |[8| and when a loop is deteeted, it 
removes the bit(s) reeeiving the smallest from B. This algorithm also adds a bootstrapping 
step and a delay-handling proeedure to further improve the bit seleetion aeeuraey but aehieves 
limited improvement for the eodes with high eolumn degrees sueh as Euelidean geometry (EG) 
EDPC eodes. 

A hybrid GDBE (HGDBE) algorithm was proposed in ||9l. In this algorithm, single- and 
multi-bit BE deeoding is performed alternatively and an escape process is used for preventing 
the deeoding proeess from being trapped in loeal minima/loops. Two extensions of the HGDBE 
algorithm were eonsidered in ifTdll and ifTSll whieh require less eomplexity at the expense of 
inferior performanee. A multi-bit GDBE algorithm with a probabilistie EBS rule was also 
suggested in ifT^ for hard-decision decoding. With the FF dH) and FBS rule of ifTSll . the multi¬ 
bit NGDBF (M-NGDBF) algorithm ifTTll aehieves the same BER performanee as that of the 
HGDBE deeoder with mueh less deeoding iterations. However, ifT^ found that new trapping set 
eonditions may exist in the M-NGDBE deeoder but ean be eliminated by re-deeoding with a 
different perturbation sequenee. 
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III. Checksum Reliability and Dynamic Weights 
A. BF Decoding and Checksum Weights 

In line with the belief propagation (BP) based SPA, En is similar to the total log-likelihood 
ratio (LLR) of Vn and —— 2sm) in (HI) is analogous to the belief sent to Vn by Cm- 
Unlike SPA, however, for (111)-(I^, the latter remains unehanged unless an Un, n G 
has been flipped, whieh leads to just a sign ehange of the belief. The general FF format, ([3]), 
ineludes two major terms that represent the deeoder’s eonfidenee on a VN’s tentative deeision 
based respeetively on its ehannel value (or the eorrelation of the ehannel value and tentative 
deeision) and the reliabilities of the related eheeksums. Sinee the ehannel values remain fixed, the 
eheeksums should be given adjusted weights at least in the later iterations when the reliabilities 
of eheeksums ehange. 

Although the flipping operation ehanges the reliability metrie of Un and the related eheeksums, 
all the FFs used in known BF deeoders use statie Wmn thereby ean neither refleet the dynamie of 
VNs’ message passing nor offer self-adjustment eapability in aeeurately updating bit reliability 
information. We present dynamie weight generation method in this seetion. 


B. Flipping Function and Decision Reliability 

The review on BF algorithms in Seetion HI] indieates elearly that the FF value is a proper 
explieit or implieit reliability metrie of a VN’s deeision. As a eheeksum in turn is a funetion of 
the assoeiated VNs’ deeisions, the eorresponding eheeksum weights should be updated aeeording 
to the eurrent FF values. A reasonable eandidate eheeksum weight is therefore given by 


^(0 


^ min -E^^}, 

n €Af{m)\n 


(16) 


where En'^ is the FF value of Vn in the /th iteration. However, (IT^ may result in negative weights 
whieh is ineonsistent with (O and (|3]); both are inereasing funetions of the nonzero eheeksum 
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number. To have proper positive checksum weights based on En, we consider the likelihood 
ratio 


A(^n) = 


f{En\Ho) 


(17) 


where Hq and Hi denote the hypotheses that = Un and Un ^ Un, respectively. The conditional 
probability density function (pdf), f{En\Hi), is the pdf of those EnS associated with a correct 
or incorrect tentative bit decision at a given iteration. It is to be interpreted as a conditional pdf 
averaged over all VNs. The basic decision theory tells us that the optimal decision rule is given 
by 


where tt. 


E{E, 


iij 


^0 7ri((goi - C 

^ 7ro(Cio — Coo) 


( 18 ) 


Pr{Hi is true) and Cij is the “cost” for accepting Hi while Hj is true. Unlike 


the conventional Bayesian minimizing error probability setting, both the costs and the a priori 


probabilities are difficult to assess. For a BF decoder, a tentative decision, except for the initial 


iteration, is determined by the previous decision and the flipping decision. On the other hand, 
both conditional pdfs in (fTTI) depend on the definition of En and the FBS rule. They vary 


from one iteration to another. Although the dependence between and En is implicit as other 


parameters are also intertwined and the conditional pdfs are thus difficult if not impossible to 


derive analytically, they can be estimated numerically by simulations. Assuming that the all¬ 
zero codeword is transmitted in every frame, i.e., = 0 is the correct decision, we depict 

in Fig. [H the evolutions of both simulated conditional pdfs with all related factors considered 
and averaged-for the IMWBF algorithm with En being the updated FF values at, say the /th 
iteration, after the tentative decision at the (/ — l)th iteration Un was made. This figure indicates 
that En does reflect the correctness of the corresponding tentative decision in that a smaller 


(negative) FF value is more likely to be associated with a correct decision. The probability of 


correct inference on Un based on En depends on the separation (distance) between two pdfs. 
The figure, however, demonstrates that the separation does not improve as the decoding iteration 
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Fig. 1: Conditional FF distributions for the IMWBF algorithm in decoding MacKay (816,272)(4,6) LDPC code (816.1A4.845 
(ID), SNR {_Eb/No)= 4 dB. 

increases. We propose a new FF in the next subseetion and show in Seetion |IV] that this new 
FF is eapable of overeoming the shortcoming of the IMWBF algorithm. 

C. A New Flipping Function 

Although TTj and Cij are diffieult to assess but the optimal Bayesian test, (fTSl) . ean be simplified 
to a threshold test based on En- Given a suitable threshold rj. Fig. [T] indicates that if —En > rj, 
the deeoder is more likely to have made a eorreet bit deeision iin whose reliability is proportional 
to —En', otherwise, the deeision is probably ineorreet. 

The foregoing discussion and the aim to have a weight-updating rule reflecting a more aeeurate 
relation among bit deeision, FF, and cheeksums, as elaborated in more details below, suggest 
that we modify (fT^ as 



(19) 
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fl{x) = 


( 20 ) 


where 

X — rj, X > 7] 

0, X < rj 

The elipping operator, ^l{x), besides ensuring only positive weights are used, ean be interpreted 
as the deeision for a CN to send no message to other linked VNs when the associated FF 
values fail to exceed the threshold, which bears the flavor of “stop-and-go” algorithms that pass 
a CN-to-VN message only if it is deemed reliable. Note that a checksum Sm is determined 
by dc bit decisions, and if = 0 and there is only one unreliable decision < rj) 

among VNs in Af{m), the checksum is likely to be valid and the decision is in fact correct. 
Hence should modify En to increase A(i?„) but not pass the message — 2sm) to 

other connected (reliable) VNs (n' G Af{m) \ n). In doing so, En has a local (among Af{m)) 
maximum FF decrease and the probability of reversing the bit decision is reduced. On the other 
hand, if Sm = 1, Wn is likely to be only local incorrect decision, the above rule will result in a 
local maximum FF increase and thus a higher probability of being flipped. When more than one 
—En, n E J\r{m) are clipped, no message is sent from Cm as the checksum itself is unreliable. 
The temporary suspension of some message propagation induced by ^l{x) also has the desired 
effect of containing the damage an incorrect message may have done and preventing the decoder 
from being trapped in a local minimum. The above discourse confirms that (l20l) does fulfill the 
goal that the weight updating should have FFs, checksums, and bit decisions join a cohesive 
effort in improving the performance of a BF decoder. 

The FF defined by (|3]) using the recursive weight-updating rule (fT9l) tends to make the check 
reliability part of the FF, — X]mGAr(n)“ 2sm), starts to grow exponentially after most 
of the correctable bits were flipped and the number of UCNs decreases to just a few. It is 
conceivable that these reliability estimates should be given different weight with more remote 
estimates having less weights. This can be done by having the check reliabilities multiplied by 
a forgetting factor, 0 < 0^2 < 1, as can be found in many recursive adaptive filters [|22ll . We 
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are unable to determine the optimal elipping threshold rj sinee a elosed-form expressions for 
Fr{En\Hi) are praetieally unobtainable for reasons mentioned before. Some simulation efforts, 
however, indieate that 77 is elose to 0 for several BF deeoders, independent of SNR and the 
iteration of interest. With the above ideas in mind, we eonsider a new FF based on (fT9l) and 
(I 20 I) using 7] = 0: 


= -2/n(l - 2Mn) “ ^2 “ 2S™), (21) 

mgA4(n) 

where 0 < 02 < 1 is a positive damping (forgetting) eonstant to be optimized by numerieal 
experiments. 


IV. New Single-Bit BF Decoding Algorithms 

We define a (eheeksum) weight-updating sehedule as a rule that seleets a subset Q of Om, 
whieh represents the set of CN indiees, and updates only those rmn,m G G- Sueh a rule 
determines the message-passing paths in the deeoding proeess (see Fig. |2] below) henee is 
ealled a sehedule. When the updated CN index set G = Om, we eall the sehedule as the 
full weight-updating schedule (FWUS). Alternate sehedules with G 7 ^ Om provide trade-offs 
between eomputational eomplexity and error-rate performanee. In this seetion, we introduee a 
elass of single-bit dynamic weighted BF deeoding methods based on (fT^ . (|2T]) . and different 
weight-updating sehedules. 


A. Single-Bit DWBF Decoding and Weight-Updating Schedules 

Combining (fT9l) and (|2T]) with the eonsideration of the ehoiee of a weight-updating sehedule, 
we obtain the elass of DWBF algorithms or, for simplieity. Algorithm |2l 
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Algorithm 2 DWBF Decoding Algorithm 

Initialization Set / = 0, ti = 2 , and Q = B = ^. Initialize r^li by (15]) for all n G AA(m), m G 
Om- Set En'^ = —r/n(l — 2 m„) for all n G On ■ 

Step 1 Compute for all m G Om- If s = 0 or / = /^ax, stop decoding and output tit; otherwise, 
/ ^ / + 1 . 

Step 2 V n G Oat, compute En^ by ([21]). 

Step 3 Update B and Vn G B, flip and En^ < - En\ 

Step 4 Update Q. Then, update rmn by (fT9l) V n G Af{m),m G Q and set rmn ^ 
y n E m G Om\Q and go to Step 1. 


Algorithm 2 describes a general class of DWBF algorithms. For hard-decision decoding, 
—r/„(l — 2uri) in (l2TI) is replaced by —(1 — 2zn){l — 2m„) and r^n is initialized as 1. The FB 
set B in Step 3 is determined by the FBS rule used, it can be (fTO . Algorithm 3 or 4 presented 
in Section |V| When (fT^ is used. Algorithm 2 is a single-bit DWBF algorithm and becomes 
a multi-bit DWBF algorithm if Algorithm 3 or 4 is used as the FBS rule. In this section, we 
consider only the FBS rule (fTOl) . which implies that only one bit is flipped (i.e., \B\ = 1) at each 
iteration unless it is used for hard-decision decoding. Hence the resulting decoder is referred to 
as the single-bit dynamic weighted BF (S-DWBF) decoder. 

As most FF values will change because of the recursive nature of (fT9l) and (|2TI) we may need 
to perform the FWUS in Step 4. This is one of the prices we have to pay when the dynamic 
weights instead of the conventional constant weights are assigned to the checksums. We call 
the FWUS-based single-bit decoding algorithm as the S-DWBF-F algorithm for simplicity. To 
lessen the computing load of this algorithm, we reduce the size of Q by prioritizing the CNs 
and update only those with a higher priority. 

We first notice that, to ensure that the newest updated information be broadcasted, the weights 
of the flipped bits’ linked checksums should have the highest updating priority. Furthermore, 
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fth Iteration (/+! )th Iteration (/+2)th Iteration 


(a) Selective weight-updating schedule A. 



/ih Iteration (/+l)lh Iteration (/+2)th Iteration 


(b) Selective weight-updating schedule B. 


Fig. 2: The TEFGs for two different selective weight-updating schedules. 


for those VNs whose FF values ehange from one side of the elipping threshold 77 of (l20l) to the 
Other side and undergo a reliability inversion, their related eheeksum weights should be renewed 
as well. With these eonsiderations, the selective weight-updating schedule A (SWUS-A) updates 
only those eheeksums (CNs) whose indices lie in 

— {mlm e e B} 

U{m|m G A4(n), (—— p) < 0, n = 0,1,..., — 1}. (22) 


The time-expanded factor graph (TEFG) shown in Fig. |2(a)| is a simple example illustrating 
how SWUS-A behaves, assuming that the only VN which generates = {1,2} is U 4 . We 
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TABLE I; Complexity of weight-updating schedules 



SWUS-A 

SWUS-B 

FWUS 

Number of Visited CNs 

dv 

min{(i„(d„ — l){dc — 1) -I- 2d„, M} 

M 

Number of Visited VNs 

min{d„(dc — 1) -|- 1, N} 

min{d„(d„ — l)(dc — 1)^ + 2[d„(dc — 1) -b 1], 

N 


denote this VN by •, the CNs visited (seleeted) by the schedule by ■, and the VNs which 
receive new CN messages by ©. 

When SWUS-A is used in the S-DWBF algorithm, the resulting algorithm is called the S- 
DWBF-A algorithm which at the /th iteration updates the CN index set Q with . 

Since only a few VNs received updated messages from the selected CNs, some are likely 
to remain constant for many iterations (e.g., U 3 ) or even during the whole decoding process. To 
spread the updated messages to more VNs, we expand the updated CN set to include both 
and 

— {mlm G A4(n), n 6 uf’} (23) 

where Uf’ = {n\n e AA(m), m G and call this updating schedule as the selective 

weight-updating schedule B (SWUS-B). In other words, the updated messages received by the 
VNs connected to the CNs in will also be forwarded to their connecting CNs in the following 
iteration, i.e., G ^ G'f^ Fig. | 2 (b)| illustrates the expanded updating range by indicating the 

extra visited CNs (those whose indices blong to Gb^) with the symbol 0 . Similarly, when this 
weight-updating schedule is used, we call the resulting algorithm as the S-DWBF-B algorithm. 

Table U lists the average number of visited CNs (to update checksum weights) and VNs 
(to compute FF) per iteration per flipped bit or FF inversion for the different weight updating 
schedules. As expected, the SWUS-A/B need much less computational complexity and this 
reduction is more impressive when dc and d^ are small. Note that for the expanded schedule 
SWUS-B, the VNs in may be linked to a common set of CNs and G^^ ^Gb^ can be nonempty 
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if the code graph has short cycles. As a result, the practical average numbers of the visited CNs 
and VNs for SWUS-B is much less than those shown in Table U which assumes a cycle-free 
code. However, it can be shown that when the code girth is larger than 8 (10), the actual average 
visited CN (VN) number is equal to that of a cycle-free code. 

B. Performance of S-DWBF Decoding Algorithms 

We apply the known single-bit BF algorithms and S-DWBF-A/B/F algorithms to decode two 
regular LDPC codes and present their performance in Figs. [3]and|ll The first code, MacKay’s 
(816,272)(4,6) rate-0.333 LDPC code (816.1A4.845 IfUl l. is a typical low-rate, low-degree 
Gallager code with no special structure. In contrast, the second code, the (1023, 781) (32,32) rate- 
0.763 EG-LDPC code, is a high-rate, high-degree code whose performance has been evaluated 
in several WBF-related works m, a, Eoi, and uni. For convenience, we refer to the above 
low- and high-rate codes as Code 1 and 2, respectively. It has been shown by simulations 
[[3l that BF decoding is very effective in decoding (high-rate, high-degree) EG-LDPC codes. 
It would be interesting to examine the BP algorithms in decoding low-rate, low-degree codes. 
The performance of the normalized min-sum (NMS) algorithm [|23]| is also given for reference 
purpose. 

Pig. [3] shows the BER performance of Code 1 with = 150. Por the S-DWBP-A, S- 
DWBP-B, and S-DWBP-P decoders, the numerically-optimized a 2 values are 0.68, 0.44, and 
0.35 whereas for the IMWBP decoder, we found ai = 0.2. Our extensive simulation concluded 
that the optimal reliability threshold r] in (l20l) is close to 0, whence rj = 0 is used for all DWBP 
algorithms. At BER=10“®, we observe that the S-DWBP-B and S-DWBP-P algorithms have 2.5 
dB and 2.6 dB gains against the RRWBP algorithm; the simple S-DWBP-A algorithm achieves a 
much smaller 0.7 dB gain as it limit its weight update to a very limited range. The performance 
of Code 2 with I max = 50 is shown in Pig. HI Unlike Code 1, Code 2 has a much higher d^, and 
the check reliability part of the GDBP algorithm’s PP, — thus dominates the 
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Fig. 3: BER performance of several single-bit BF (S-BF) decoders as a function of SNR (Et/No) for Code 1. 



E/N^ (dB) 


Fig. 4: BER performance of several S-BF decoder as a function of SNR for Code 2. 


FF value after a few iterations and its performanee is similar to that of Gallager’s BF algorithm, 
espeeially when SNR is high. To improve its performanee we insert a damping faetor so that 
dH]) beeomes 


En 2/n(l Ct 3 ^ ^ (1 2Sfn)' 

meM[n) 


(24) 
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TABLE II: Average number of visited CNs in S-DWBF algorithms 


Code 1 (M = 544); SNR = 4 dB 

Iteration 

S-DWBF-A 

S-DWBF-B 

10 

15.7 

137.2 

30 

10.1 

88.7 

50 

8.3 

76.4 

Code 2 (M = 1023); SNR = 3.4 dB 

Iteration 

S-DWBF-A 

S-DWBF-B 

5 

105.3 

1023 

10 

56.3 

1023 

20 

38.6 

1023 


This modification multiplies the seeond summation of ([ 8 ]) by a^, whieh is analogous to the 
Lagrange multiplier in (the ehecksum) eonstrained optimization and when 0:3 = 1, (l24l) degen¬ 
erates to dHl). The optimal 0:3 for (l24l) is elose to 1/17 for Code 2. Referring to Q, the IMWBF 
algorithm uses cti = 1.8 and the S-DWBF-A (B) decoder uses 0:2 = 0.33 (0.12) in (|2TI) . Due 
to the high VN/CN degrees of Code 2, almost all CNs are updated by S-DWBF-B algorithm 
after two or three iterations, yielding performance similar to that of the S-DWBF-F deeoder. The 
same figure shows that the S-DWBF-A decoder provides about 0.25 dB performanee gain with 
respect to the IMWBF deeoder at BER = 10“® and the S-DWBF-B algorithm offers additional 
0.1 dB gain. 

Table |n] presents the average number of CNs visited (in the associated ETFGs) by different 
sehedules for Code 1 and Code 2 at seleeted iterations. Although the S-DWBE-E algorithm has 
the best BER performanee among the single-bit algorithms, it requires higher computational 
complexity. By eontrast, the S-DWBE-A/B algorithms provide trade-offs between eomplexity 
and performanee. Eurthermore, for both seleetive weight-updating schedules, the number of 
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Fig. 5: Conditional FF distributions for the S-DWBF-F algorithm in decoding Code 1, SNR = 4 dB. 


visited CNs decreases when one proceeds with more iterations as the numbers of flipped bits 
and reliability-inverted VNs decreases. We plot the FF value distributions for the S-DWBF-F 
algorithm in Fig. [51 In contrast to Fig. 1, where the separation between the two conditional pdfs 
exhibits little variation, our DWBF algorithm is able to pull /(i?„|ifo) away from f{En\Hi) 
as the decoding process evolves. Since the reliability of a decoder’s decisions based on En 
depends on the separation (distance) between the two pdfs, the improved separation is certainly 
welcome. As mentioned before, we use = 0 in (l20l) for all S-DWBF algorithms. Although the 
optimal clipping threshold is unknown. Fig. [5] does convince us that 0 is a valid and convenient 
choice and the FF with the proposed dynamic checksum weighting does give a much better VN 
reliability reference. 

V. New Flipped-Bit Selection Methods and Multi-Bit BF Decoding Algorithms 

Multi-bit BF decoding algorithms were developed to accelerate the convergence performance. 
Most of these algorithms use the simple threshold comparison FBS rule ([TTI) discussed in 
Section jn] IfT^ . |[T3]| . |[T5ll - lfT^ . Even if the threshold A has been optimized by numerical 
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experiments, we find that it is neeessary to add the option B = {n\n = argmaxji?j} in ease 
B = % whose oeeurrenee probability during a deeoding proeess is nonzero. In faet, simulation 
results indieate that, depending on SNR and the deeoding/schedule algorithms used it ean be 
higher than 0.2 for Code 1. When this option is ineluded in FB deeision, the resulting FBS rule 
is called Algorithm [3] or the Ml-FBS rule. 

Algorithm 3 Flipped Bit Selection Rule 1 (Ml-FBS) 

Step 1 Find B = {n\En > A}. If S 7 ^ 0, stop; otherwise, proceed to Step 2. 

Step 2 Find B = {n\n = argmaxj Ei\ 


Recall that the PWBF algorithm uses the FF of the IMWBF algorithm as the VN reliability 
metric with the FB set (fT3l) determined by the FS count of (fT^ . That is, a CN sends a flip 
signal to its most unreliable linked VN only and the VNs which receive sufficient number of 
reliability warnings (flip signals) shall be flipped. It turns out that, with this extra filtering of 
VN-to-CN messages (E^s) and selective CN-to-VN flip signal passing, the PWBF algorithm 
is able to outperform the IMWBF decoder in both convergence rate and error rate ifTOll . This 
performance gain motivates us to ponder if a more elaborated flipping decision strategy that uses 
more information can bring about further performance improvement for the BF decoders using 
either the proposed FF (|2TI) or other FFs discussed in Section |nl 

A. Flipping Intensity 

Let En ^^rn(£j<A{n) Un, \m UTg maX^(z_y\/'(fn) Eji, and Ad (?T.) 

{m|m G Ad(n), Am = rr}- With these notations, we define the. flipping intensity (FI) of (received 
by) Vn as 



(UXm F'rn) T ^lSm[l 


(25) 


mGA4' {n) 
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where 9o > 9i >0 and 6{x) is the Kronecker delta function. For simplicity, both 9o and 6>i are 
confined to be integers so that FI is integer-valued. The above definition implicitly implies that 
= 0 if n) = 0 and only UCNs have a say in deciding FI. It also implies that a VN has a 
nonzero FI only if it has the largest FF value among Af{m) and if it is connected to the largest 
number of UCNs among its peers in AA(m), the associated FI should be even higher (6*o > 9i). 
In both cases, a UCN Cm will send a non-negative message to the VN with the highest FF value 
in the set Af{m). However, if Cm is a passed CN (PCN) = 0) and dc is small, it often implies 
that the tentative decisions of its linked VNs are all correct. Hence if the flipped bits are to be 
selected by checking whether the associated FI is greater than a threshold, vx^ should have a 
smaller probability of being chosen. This can be done by having the PCN send a drag message 
6 * 2 (sm — !)• But if there is doubt that Cm is connected to even incorrect bit decisions, the PCN 
has better not sending such a message. We decide that this is likely to be the case if Ux^ 7 ^ /im 
for this inequality means that at least one VN in has more connected UCNs than vx^- 

With UCNs and PCNs contributing opposite signals, we modify (1251) for all n, 0 < n < iV as 

Fn = [92{Sm — ^) F 9QSm\5{Uxm ~ ^^m) 9iSm\}- — d{Ux^ — (26) 

(n) 

where 02 < 0o is a nonnegative integer. On the other hand, when dc is large, it is less likely 
that Sm = 0 automatically implies correct decisions on all its linked bits and we thus stick to 
(1251) . having no PCN to contribute to FI. Although the thresholds 0/8 can be any nonnegative 
real numbers, to simplify implementation, we let 0/s be nonnegative integers such that the FI 
is integer-valued. 

B. Flipped-Bit Selection Rule 

A simple FI-based FBS rule is to flip the bits in the FB set B = {n\Fn > Api}. But the 
optimal threshold Api is not easy to determine especially for a code with low VN degree. 
A smaller threshold may cause incorrect flipping decisions while a large threshold tends to 
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slow down the convergence or even cause decoding failure as no VN meets the the flipping 
requirement. To overcome this dilemma, we select a relative high FI threshold and use the 
FB set B = {n\Fn > Api} if it is nonempty. Otherwise, B = {n\Un = maxi^rUi} where 
T = {n\Fn = maXjFj}. We summarize below the new FBS rule as Algorithm |4] which, for 
convenience of reference, is called the M2-FBS rule. 

Algorithm 4 Flipped Bit Selection Rule 2 (M2-FBS) 

Step 1 For n = 0,1,..., iV — 1, compute by (l25l) or (|2^ . 

Step 2 Find B = {n|F„ > Api}. If S 7 ^ 0, stop; otherwise, proceed to Step 3. 

Step 3 Update T and find B = {n\Un = maxUj}. 

ieT 


Note this FBS rule is independent of the FF and can be used in conjunction with different 
FFs no matter whether the checksum weights are constant or not. 

Loop-detection/breaking procedures can be included in our FBS algorithm if necessary. The 
loop detection scheme used lO is an appropriate choice. When a loop is detected, we generate 
a disturbance on the tentative decoded sequence by switching to the FB set 

B = {n\Un = maxUi}. (27) 


C. Numerical Results 

Different combinations of the FBS rule, the FF, and the weight-updating schedule, used lead 
to different decoding algorithms. The error-rate performance and decoding speed of various 
decoders are presented in this subsection. 

1) Abbreviations: For convenience of reference, we adopt a systematic labeling scheme similar 
to that used in Section IIV-BI to describe a decoding method. We denote a decoder by groups 
of capital letters separated by hyphens, specifying respectively the FBS rule, the FF and the 
weight-updating schedule used with the 3-field form, FBS rule-FF-weight updating schedule. 
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TABLE III: Simulation parameter settings 


Algorithm 

Code 1 

Code 2 

HGDBF 

as = 1 

as = 1/17 

AMWBF 

ai = 0.2 

ai = 1.8 

IPWBF 

ai = 0.2, Aps = 1 

Oil = 1-8, Aps = 10 

M2-IMWBF 

Oil = 0.2, Api = 5 

oi = 3.2, Api = 16 

M2-GDBF 

as = 1, Api = 1 

as = 1/17, Afi = 10 

Ml-DWBF-A 

a2 = 0.7, A = 0 

aa = 0.33, A = 0 

Ml-DWBF-B 

aa = 0.35, A = 0 

aa = 0.3, A = 0 

M2-DWBF-A 

aa = 0.58, Afi = 1 

02 = 0.33, Api = 4 

M2-DWBF-B 

a2 = 0.35, Api = 1 

aa = 0.3, Afi = 1 


That is, the first filed is used to indieate if single (S) or multiple (M) bits are to be flipped 
in an iteration and, for the latter case, if the simple FF based (Ml) or the more complicated 
FI-based (M2) FBS rule is adopted. The second field contains the abbreviation of the known or 
proposed algorithm such as IMWBF, GDBF or DWBF whose FF is used. The third field tells 
whether a selective (A or B) or the full (F) weight-updating schedule is used. Since only the 
DWBF algorithms need to update checksum weights, the third field is omitted for non-DWBF 
based decoders. Hence, Ml-DWBF-A represents the decoder that uses the Ml-FBS rule, the 
DWBF FF, and SWUS-A, and M2-IMWBF(-GDBF) denotes the decoder that uses the M2-FBS 
rule and the IMWBF (GDBF) algorithm’s FF. For known constant weight algorithms without 
FBS modification and SWUS, we keep conventional abbreviations like AMWBF, IPWBF, and 
HGDBF only. 

2) Parameter values used: For the decoders based on M2-FBS rule, we use the FI weights 
6*0 = 3, 6*1 = 2, and 6*2 = 1. Other major parameter values for different multi-bit BF algorithms 
are listed in Table Hill The remaining parameters needed for the IPWBF algorithm follow those 
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Fig. 6: BER performance of various multi-bit BF (M-BF) decoding algorithms as a function of SNR for Code 1. 


suggested in [fTTll . and those assoeiated with the eseaping proeess in the HGDBF algorithm are 
also optimized. For simplieity, the FF elipping threshold rj in (l20l) and the threshold A used in 
(fTTI) in Ml-DWBF-A/B algorithms are set to zero. 

Note that the parameters assoeiated with a deeoder are eorrelated, i.e., if the optimal value of 
a parameter is dependent on other parameters’ values used, although the eorrelation may not be 
very high. Henee, we try to jointly optimize these parameters to minimize the eonverged error 
rate. Furthermore, simulation results indieate that the optimal parameter values are insensitive 
to SNR. 

3) BER and FER performance: Fig. 0 shows the BER performanee of different multi-bit BF 
algorithms for Code 1 when /^ax = 50. The effeetiveness of the M2-FBS rule ean also be verified 
by eomparing the required Eh/No for BER = 10“^: the M2-IMWBF deeoder outperforms the 
IPWBF deeoder by approximately 1.7 dB and the M2-GDBF algorithm has a 0.4 dB gain over 
the HGDBF algorithm. The DWBF algorithms yields better BER performanee even with the 
simple Ml rule and, when the M2 rule is used, its performanee beeomes eloser (0.4 dB) to that 
provided by the NMS algorithm. The eonvergenee behaviors of these algorithms are shown in 
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Fig. 7: Frame error rate (FER) convergence performance of various M-BF decoding algorithms using conventional or M2-FBS 
rule; Code 1, SNR = 5 dB. 



Fig. 8: FER convergence performance of multi-bit DWBF (M-DWBF) decoding algorithms; Code 1, SNR = 3.25 dB. 


Figs. |7] and [8l The results show that the M2 rule gives better BER performanee and, for both 


the DWBF and M2-GDBF algorithms, the eonvergenee rate is improved as well. 


Note that in Figs. [MHl loop-deteeting/breaking sehemes are aetivated for all but the M2- 


DWBF-B algorithm. In general, loops are mueh less likely to oeeur in a DWBF deeoder than 
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Fig. 9: BER performance of various M-BF decoding algorithms as a function of SNR for Code 2. 


in a static CN weight decoder. When the FWUS or SWUS-B is used to decode Code 2, our 
simulations detect no loop for both codes whence there is no need for a loop breaker. This 
is because the time-varying checksum weights of the DWBF algorithm and wider message 
magnitude propagation ranges of the FWUS or SWUS-B schedule have made the BF decision 
related variables, En, Un, /fm, and to have much larger dynamic ranges; see also Figs. [U and 

El 

The BER (with /^ax = 20) and frame error rate (FER) convergence performance of various 
multi-bit BE decoders for Code 2 are respectively presented in Eigs.|9]and[T0l By comparing the 
two sets of BER curves, M2-GDBE versus HGDBE and M2-IMWBE versus AMWBE, we verify 
the effectiveness of the new EBS (M2) rule. Although the M2-IMWBE algorithm yields the same 
converged BER as that of the IPWBE decoder for this code, it gives better EER performance 
in the first few iterations. We further notice that the M2-DWBE-A (B) decoder is superior to 
the Ml-DWBE-A (B) decoder in both BER performance and decoding speed. The former yields 
performance very close to that of the NMS algorithm while the latter suffers only about 0.1 dB 
performance degradation against the NMS decoder at BER=10“^. Eigs. |9]and[in]also show that 
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Iteration Number 


Fig. 10: FER convergence performance of several M-BF decoding algorithms; Code 2, SNR = 3.4 dB. 


the performance gap between the M1/M2-DWBF-A and M1/M2-DWBF-B decoders is smaller 
than that for Code 1. This is due to the high VN/CN degrees of Code 2: the high VN degree 
increases the probability that a CN is visited by the SWUS-A while the high CN degree helps 
spreading the updated weights to more VNs. 

We want to remark that a) only the AMWBF and HGDBF decoders need a loop-breaker in 
decoding Code 2 since for the other decoders, loops are rarely detected and b) although the 
Ml-FBS rule is simpler, our simulations indicate that the M2-FBS rule can significantly reduce 
the probability of decoding loops when it is used in conjunction with the DWBF, GDBF, or 
IMWBF algorithms. This is particular useful when using conventional FFs to decode low-degree 
codes. 

D. Complexity Analysis 

Besides the syndrome computing, which is the same for all algorithms, the computational 
complexity of a BF decoding algorithm consists mainly of three parts: i) FF update, ii) flipped 
bits selection, and iii) weight/message update. Once new CN messages, —Wmn{^ — 2sm) in © 
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TABLE IV: Average UCN and visited CN numbers 


Code 1 (M = 544); SNR = 3.25 dB 

Iteration (0 

IPWBF 

M2-IMWBF 

M2-GDBF 

Ml-DWBF-A 

Ml-DWBF-B 

M2-DWBF-A 

M2-DWBF-B 

Af® 

M 

p(0 

^S3 

M 

Aiwu 

p(0 

^S2 

Ai\yy 

p(0 

^S2 

M 

Aiwu 

-'''fb 

M 

itiwu 

-''^FB 

5 

91.7 

544 

0.36 

544 

115.2 

3.6 X 10“^ 

386.7 

3.5 X 10“® 

544 

135.2 

34.1 

544 

506.6 

17.2 

to 

78.3 

0.52 

110.6 

0.0035 

162.8 

0.0014 

80.1 

19.1 

414.2 

8.8 

15 

71.0 

0.67 

107.7 

0.0092 

132.1 

0.0019 

69.0 

16.5 

338.1 

6.3 

20 

68.6 

0.65 

96.5 

0.0099 

131.3 

0.0012 

66.9 

15.8 

328.7 

5.9 

Code 2 (M = 1023); SNR = 3.4 dB 

Iteration (0 

IPWBF 

M2-IMWBF 

M2-GDBF 

Ml-DWBF-A 

Ml-DWBF-B 

M2-DWBF-A 

M2-DWBF-B 

Af™ 

Af™ 

A4<'> 

s./(0 

Aiwu 


A7<') 

A./(0 

-'^-‘wu.o 

-'''fb 

Af® 

-'^-'wu.o 

-''^FB 

3 

172.1 

217.2 

326.2 

267.3 

1023 

281.6 

382.4 

18.3 

332.4 

690.6 

24.6 

5 

198.3 

287.7 

351.9 

374.9 

1023 

318.5 

386.9 

26.1 

322.7 

700.3 

27.0 

to 

329.4 

360.2 

407.2 

588.7 

1023 

425.8 

415.4 

46.4 

436.5 

586.5 

50.7 

15 

353.6 

378.4 

414.6 

644.5 

1023 

438.1 

421.0 

48.2 

445.0 

577.0 

59.5 


TABLE V: Computational complexity for various decoding algorithms (Cl; Code 1, C2: Code 2) 


Operation 

HGDBF 

IPWBF 

M2-GDBF/IMWBF 

Ml-DWBF-A/B 

M2-DWBF-A/B 

NMS 

Integer Additions 

0 

A7f'> 

Cl: M 

C2: A/f'^ 

0 

Cl: M 

C2: 

0 

Real Number Additions 

0 

0 

0 

0 

0 


Integer Comparisons 

0 

N 

Cl (M2-IMWBF); M{d^ ~ 1) + N 

+ 

Cl (M2-GDBF): M{d^ - 1) -F W 

C2: Mf'’ (dc -1) + N 

0 

Cl: Af(dc -1)+N 

C2: M\^\dc-1) + N 

0 

Real Number Comparisons 

N 

Mf>(dc-1) 

Cl: M{d^ - 1) 

C2: - 1) 

Cl: - 3) 

+P®(JV-1) 

C2: - 3) 

Cl: A7(4 - 1) + Afi,'i(de - 2) 
+N^d„ 

C2: (Af« + A4«,„)(2dc - 3) 
+N^d„ 

M{2dc - 3) 


or —rmn(l — 2sm) in (l2T]) . are available, the FF update is just adding all returned CN messages 
and —ai\yn\ or ——2 m„). There is little differenee among the BF deeoders in FF computing. 
The only exception is that used by GDBF algorithms, both ([8]) and (|2^ require integer additions 
only, iii) is needed for DWBF algorithms but not other BF algorithms which require only a sign 
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TABLE VI: Averaged overall complexity (xlO®) per frame for achieving FER=10 ® (Real: real comparison or addition; Int: 
integer comparison or addition) 


Code 1 

SNR 

Ml-DWBF-A 

Ml-DWBF-B 

M2-DWBF-A 

M2-DWBF-B 

NMSA 

^max 

Real 

^max 

Real 

^max 

Real 

Int. 

^max 

Real 

Int. 

^max 

Real 

3.5 dB 

132 

19.1 

101 

46.4 

109 

41.7 

51.1 

47 

64.1 

54.4 

12 

44.2 

3.625 dB 

89 

15.3 

76 

41.7 

66 

36.2 

49.6 

35 

57.9 

49.5 

10 

41.6 

3.75 dB 

71 

12.5 

64 

38.1 

42 

32.1 

45.5 

28 

52.8 

45.4 

9 

39.2 

Code 2 

SNR 

Ml-DWBF-A 

Ml-DWBF-B 

M2-DWBF-A 

M2-DWBF-B 

NMSA 

^max 

Real 

^max 

Real 

^max 

Real 

Int. 

^max 

Real 

Int. 

^max 

Real 

3.4 dB 

49 

83.5 

46 

172.5 

35 

176.4 

66.7 

34 

269.1 

80.3 

22 

255.9 

3.6 dB 

11 

59.6 

8 

135.2 

8 

130.7 

50.4 

7 

210.7 

62.3 

6 

218.2 

3.7 dB 

7 

53.3 

6 

123.0 

6 

115.4 

44.9 

6 

189.3 

55.7 

5 

206.8 


change. Therefore, in the next three subseetions, we eonsider ii) first, followed by the diseussion 
of iii), the extra eomplexity requirement for DWBF algorithms, and finally compare the eombined 
computational complexity of ii) and iii). The additional complexity such as that associated with 
a loop-breaking seheme, is addressed at the end of this seetion as well. Sinee most algorithms, 
except those using the M2-FBS rule, whieh need additional integer operations and memory for 
storing the FIs and UCN numbers, require approximately the same storage spaee, we diseuss 
only the computational complexity. 

1) FBS complexity: The HGDBF deeoder needs only N real eomparisons in (fTTI) in seleeting 
the flipped bits. For the IPWBF deeoder, dc — 1 real comparisons are required to find the 
most unreliable eonneeted VN per UCN and a total of integer additions and N integer 
eomparisons are needed to eompute the FS and deeide the FB set (fT3l) . where is the 
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average UCN number in the /th iteration. 

For the Ml-FBS rule, the eomplexity of Step 1 is ignored sinee we use A = 0 and a threshold 
eomparison needs a sign-bit eheck only. The average eomplexity of Step 2 is P 52 (-^ “ 1 ) 
eomparisons, where P 52 Ihe probability that Step 2 is aetivated in the /th iteration. For the 
M2-FBS rule (Algorithm ID), Step 1 needs dc — 1 real and dc — 1 integer eomparisons per CN in 
finding Am and eheeking if vx^ has the most eonneeted UCNs. Eaeh CN has to send an integer¬ 
valued message, 6 * 0 , Oi, or — 6 * 2 , to one of its eonneeted VNs, implying an integer addition in (|2^ 
or (1251) . Sinee the former involves both UCNs and PCNs while the latter involves only UCNs, 
all M CNs have to perform all the above operations when deeoding Code 1 in eontrast to 
CNs for Code 2. Moreover, N integer eomparisons are required in Step 2 for deeiding the FB 
set. The average eomplexity of Step 3 is approximately equal to integer comparisons, 

where is the probability that Step 3 is activated at the /th iteration. 

2) Weight update complexity: Updating the weights associated with CN Cm in the Ml-DWBF- 
A/B decoders require 2dc — 3 real comparisons for finding the indices associated with the smallest 
and second smallest —E^s, n G For M2-DWBF-A/B decoders, however, most of Am’s 

have been found in the FBS step, hence only the second smallest ones remain to be found for 
computing new weights. A more detailed analysis is given in the next two paragraphs. 

We first consider a low CN degree code such as Code 1. After CNs compute Am’s and 
VNs compute their FIs via (|2^ . the flipped bits are decided and flipped. The M2-DWBF-A/B 
algorithms then invert the associated FF values (Step 3 of Algorithm O and update the CN 
weights (Step 4 of Algorithm O. As only a small portion of the visited CNs are connected to 
the flipped bits, most visited CNs require only dc — 2 real comparisons for finding the second 
smallest —En- For a visited CN that links to flipped bits, we only need to compare the connected 
flipped bits’ E^s with the original smallest —En to find a new minimum —En since only the 
flipped bits’ U„’s are changed between the bit flipping and weight updating. As a result, for a 
visited CN linking to t flipped bits, only additional t real comparisons are required for updating 
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the smallest —E^ before finding the second smallest —En. If we denote by and iVpg the 
average numbers of visited CNs and flipped bits at the /th iteration, respectively, we need, on 
the average, at most M^^{dc — 2) + real comparisons for updating weights, where N^^d^ 

accounts for the sum of all additional (t) comparisons. 

Decoding a high CN degree code requires 2dc — 3 real comparisons for updating the weight of 
a visited PCN and dc — 2 + t for an UCN, as the FI formula (1251) involves only UCNs. Denote the 
average numbers of visited PCNs and UCNs at the /th iteration by q and j. We observe 
from simulations that almost all UCNs are visited (i.e., Hence, M2-DWBF-A/B 

decoders require an average of M^\jQ{2dc — 3) and at most M[’'\dc — 2) + Np^dv real comparisons 
per iteration for computing new CN weights of visited PCNs and UCNs, respectively. For both 
cases we ignore the complexity of the threshold comparison in (fT9l) since rj = 0. 

3) Complexity Summary: Table HVl presents the simulated average numbers of m[''\ 

ivS, Ps 2 ^ and at selected iterations for the IPWBF, M2-IMWBF/GDBF, Ml-DWBF- 
A/B, and M2-DWBF-A/B algorithms. Since the simulation results indicate that when decoding 
Code 2 with the Ml-DWBF-A/B decoders. Step 2 of the Ml-FBS rule is never activated, we 
list the values for Code 1 only. Similarly, Step 3 of the M2-FBS rule is needed only if the 
M2-IMWBF algorithm is used to decode Code 1, we thus specify the P^l values for this case 
only. 

Considering both the FBS rules and weight updating, we summarize the computational com¬ 
plexity which includes real/integer additions and comparisons per iteration for various BF and 
the NMS algorithms in Table |Vl As computing the total LLRs in the NMS algorithm requires 
the same efforts as that of computing FF values in BF decoders, only the efforts needed for 
computing the CN-to-VN and VN-to-CN messages are listed in the table. 

Table rvTI presents the simulated average complexity of various DWBF and the NMS algorithms 
for decoding a frame with a target FER of 10“^ at different SNRs. As an integer (or real) 
comparison requires about the same computational complexity as that of an integer (or real) 
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addition (hardware implementation of eomparison ean even be simpler than addition). Both are 
thus eounted equally. We show in this table the average integer and real operations and the 
maximum iteration number (/max) needed. 

Tables IV-VI and Figs. [6l-fT0l provide useful information for studying tradeoffs between perfor- 
manee, complexity, and convergence rate when combining different FBS rules, FFs and weight¬ 
updating schedules. In particular. Table IVll shows that when decoding Code 1, both Ml-DWBF- 
A and Ml-DWBF-B algorithms need less computational complexity to achieve FER=10“^ than 
that needed by the NMS algorithm in higher SNR (say, > 3.625 dB) region. The M2-DWBF-A 
algorithm also needs less real operations in comparison with the NMS algorithm. For decoding 
Code 2, the Ml-DWBF-A, Ml-DWBF-B, and M2-DWBF-A algorithms need less total (real -i- 
integer) operations to achieve the FER requirement while the required iteration numbers are also 
comparable to that needed for the NMS algorithm in higher SNR region. Furthermore, based on 
Tables |IVl |Vl and Fig. we conclude that the M2-IMWBF and M2-GDBF algorithms require 
far less complexity than that of the IPWBF algorithm in decoding Code 1. 

Among the the decoding algorithms compared in Table IVl the IPWBF algorithm uses a simpler 
FBS operation but it has to perform a delay-handling process in every iteration plus an initial 
bootstrapping step. These two extra operations need off-line computing effort in searching for the 
corresponding optimal parameter values. They also require additional storage and computational 
complexity. Although the HGDBF algorithm does not have to sort the FF values, three real 
thresholds, one for the multi-bit flipping mode and two for the escape (loop-breaking) process 
are required in its FBS rule, resulting extra off-line search and random variable generations. Other 
off-line efforts include the searches for ai (M2-IMWBF), 02 (Ml- and M2-DWBF-A/B), and 
Api (e [—dvd 2 , dy9o]). As mentioned before, they must be jointly optimized. For the M2-GDBF 
algorithm, only the optimal Api has to be found. 

Our loop-breaking scheme (ITTl) is simpler than those used by other decoding algorithm and 
more effective than the methods used by the IPWBF and AMWBF algorithms which remove 
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the bit(s) having maximum or from when S = 0, the deeoding proeess will be foreed 
to terminate after the removal. Instead of redueing \B\, the eseape proeess (1271) and that used 
by the HGDBF algorithm perturb the tentative decoded sequence to break a loop. The latter, 
however, has to generate Gaussian random variables. 

VI. Conclusion 

We divide a typical BF LDPC code decoding algorithm into three major components, namely 
1) VN decision reliability (FF) computing and the associated CN reliability (checksum weight) 
update formula; 2) the FBS rule; and 3) the checksum weight-updating schedule. These three 
components determine the performance and complexity of a BF decoder. We develop novel 
FF and FBS rules to improve the BF decoding performance. On the other hand, the checksum 
weight update operation is a complexity concern for the DWBF decoders, we propose selective 
weight-updating schedules to reduce the implementation complexity with little performance loss. 

Different combinations of FF, checksum weight-updating method and schedule, FBS rule, 
result in different decoder structures. We simulate the error rate and convergence performance 
of various decoders, and the resulting numerical behaviors confirm the effectiveness of our new 
design proposals. We show that the combinations of the new multi-bit FBS rules with known 
BF algorithms achieve significant performance gain especially for a high-rate code. Detailed 
complexity analysis on various decoder structures is provided for complexity and performance 
tradeoff studies. We find that, compared with the NMS algorithm, the combination of the new 
FBS rules with our DWBF algorithms require less complexity in achieving a target FER if SNR 
is sufficiently high. We also find that the convergence rates are comparable when decoding a 
high-rate code. 
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